Deep Learning of Hierarchical Structure

نویسنده

  • Chris Manning
چکیده

Hierarchical and recursive structure is commonly found in inputs from the richest sensory modalities, including natural language sentences and scene images. But such hierarchical structure has traditionally been a strong point of both structured and supervised models (whether symbolic of probabilistic) and a weak point of both neural networks and unsupervised learning. I will present some of our recent work on a recursive neural network architecture that tries to address these issues. The system was initially developed to jointly parse natural language sentences and to learn syntacticosemantic vector representations of phrases. The system is based on a context-aware recursive neural network (RNN), which can both find a globally optimal parse tree with dynamic programming and induce distributed phrase representations for unseen, variable-sized inputs. The induced representations capture interesting semantic similarities and can be used successfully as features for finding syntactically and semantically plausible paraphrases. For instance, the phrases ”declined to comment” and ”would not disclose” are close by in the induced embedding space. However, the same approach can also be applied to find hierarchical structure in images. Discovering this hierarchical structure helps us to not just identify the units that an image contains but also how they interact to form a whole. We introduce a max-margin structure prediction architecture based on a recursive neural network that can successfully recover such structure from complex scene images. For semantic scene segmentation, annotation and classification, this algorithm obtains a new level of state-of-the-art performance for segmentation on the Stanford background dataset (78.1%), outperforming Gist descriptors for scene classification by 4%. The recursive neural network parser is trained on supervised data. I finally discuss extending this work in a less supervised direction by exploring using recursive autoencoders for predicting sentence-level sentiment distributions. The model captures compositional semantics and learns phrase feature representations sufficiently well as to outperform more traditional supervised sentiment classification methods operating on flat bags of features, even without the use of any predefined sentiment lexica or polarity shifting rules. This talk presents joint work with Richard Socher and Andrew Ng.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hybrid Optimization Algorithm for Learning Deep Models

Deep learning is one of the subsets of machine learning that is widely used in Artificial Intelligence (AI) field such as natural language processing and machine vision. The learning algorithms require optimization in multiple aspects. Generally, model-based inferences need to solve an optimized problem. In deep learning, the most important problem that can be solved by optimization is neural n...

متن کامل

A Hybrid Optimization Algorithm for Learning Deep Models

Deep learning is one of the subsets of machine learning that is widely used in Artificial Intelligence (AI) field such as natural language processing and machine vision. The learning algorithms require optimization in multiple aspects. Generally, model-based inferences need to solve an optimized problem. In deep learning, the most important problem that can be solved by optimization is neural n...

متن کامل

Learning Hierarchical Features from Deep Generative Models

Deep neural networks have been shown to be very successful at learning feature hierarchies in supervised learning tasks. Generative models, on the other hand, have benefited less from hierarchical models with multiple layers of latent variables. In this paper, we prove that hierarchical latent variable models do not take advantage of the hierarchical structure when trained with some existing va...

متن کامل

Detecting Overlapping Communities in Social Networks using Deep Learning

In network analysis, a community is typically considered of as a group of nodes with a great density of edges among themselves and a low density of edges relative to other network parts. Detecting a community structure is important in any network analysis task, especially for revealing patterns between specified nodes. There is a variety of approaches presented in the literature for overlapping...

متن کامل

Learning Hierarchical Features from Generative Models

Deep neural networks have been shown to be very successful at learning feature hierarchies in supervised learning tasks. Generative models, on the other hand, have benefited less from hierarchical models with multiple layers of latent variables. In this paper, we prove that hierarchical latent variable models do not take advantage of the hierarchical structure when trained with existing variati...

متن کامل

Reinforcement Learning with Deep Architectures

There is both theoretical and empirical evidence that deep architectures may be more appropriate than shallow architectures for learning functions which exhibit hierarchical structure, and which can represent high level abstractions. An important development in machine learning research in the past few years has been a collection of algorithms that can train various deep architectures effective...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011